An EM-based Multi-Step Piecewise Surface Regression Learning Algorithm

نویسندگان

  • Juan Luo
  • Alexander Brodsky
چکیده

A multi-step Expectation Maximization based (EM-based) algorithm is proposed to solve piecewise surface regression problem which has typical applications in market segmentation research, identification of consumer behavior patterns, weather patterns in meteorological research, and so on. The multiple steps involved are local regression on each data point of the training data set and a small set of its closest neighbors, clustering on the feature vector space formed from the local regression, regression learning for each individual surface, and classification to determine the boundaries for each individual surface. An EM-based iteration process is introduced in the regression learning phase to improve the learning outcome. The reassignment of cluster identifier for every data point in the training set is determined by predictive performance of each submodel. Cross validation technique is applied to the scenario in which the number of piecewise surfaces is not given in advance. A few clustering quality validity indexes such as Silhouette index and Davis-Bouldin index are adopted to estimate the number of piecewise surfaces as well. A set of experiments based on both artificial generated and benchmarks data source are conducted to compare the proposed algorithm and a few widely-used regression learning packages to show that the proposed algorithm outperforms those packages in terms of root mean squared errors (RMSE) of test data set.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An EM-based Ensemble Learning Algorithm on Piecewise Surface Regression Problem

A multi-step Expectation-Maximization based (EM-based) algorithm is proposed to solve the piecewise surface regression problem which has typical applications in market segmentation research, identification of consumer behavior patterns, weather patterns in meteorological research, and so on. The multiple steps involved are local regression on each data point of the training data set and a small...

متن کامل

Piecewise Surface Regression Modeling in Intelligent Decision Guidance System

An intelligent decision guidance system which is composed of data collection, learning, optimization, and prediction is proposed in the paper. Built on the traditional relational database management system, the regression learning ability is incorporated. The Expectation Maximization Multi-Step Piecewise Surface Regression Learning (EMMPSR) algorithm is proposed to solve piecewise surface regre...

متن کامل

A Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf

Evaporation phenomena is a effective climate component on water resources management and has special importance in agriculture. In this paper, Bayesian belief networks (BBNs) as a non-linear modeling technique provide an evaporation estimation  method under uncertainty. As a case study, we estimated the surface water evaporation of the Persian Gulf and worked with a dataset of observations ...

متن کامل

A Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf

Evaporation phenomena is a effective climate component on water resources management and has special importance in agriculture. In this paper, Bayesian belief networks (BBNs) as a non-linear modeling technique provide an evaporation estimation  method under uncertainty. As a case study, we estimated the surface water evaporation of the Persian Gulf and worked with a dataset of observations ...

متن کامل

An integrated heuristic method based on piecewise regression and cluster analysis for fluctuation data (A case study on health-care: Psoriasis patients)

Trend forecasting and proper understanding of the future changes is necessary for planning in health-care area.One of the problems of analytic methods is determination of the number and location of the breakpoints, especially for fluctuation data. In this area, few researches are published when number and location of the nodes are not specified.In this paper, a clustering-based method is develo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011